Matching Automata for Regular Patterns
نویسندگان
چکیده
Pattern mat hing me hanisms based on regular expressions are featured in a number of re ent languages for pro essing tree-stru tured data su h as XML. A ompiler for su h a language must address all the familiar problems of pattern optimization in fun tional languages with ML-style algebrai datatypes and pattern mat hing, plus some new ones arising prin ipally from the use of re ursion in patterns. We propose mat hing automata as a uniform framework for designing regular pattern ompilers and omparing different ompilation strategies. Using this framework, we de ne two pattern ompilers|one generating ba ktra king ode and the other a non-ba ktra king variant|by spe ifying two sets of stru tural restri tions on mat hing automata and giving algorithms (with sket hes of orre tness proofs) that onstru t automata of these forms from sour elanguage patterns.
منابع مشابه
Selective Regular Expression Matching
The signature-based intrusion detection is one of the most commonly used techniques implemented in modern intrusion detection systems (IDS). One of the powerful tools that gained wide acceptance in IDS signatures over the past several years is the regular expressions. However, the performance requirements of traditional methods for matching the incoming events against regular expressions are pr...
متن کاملA Term Pattern-Match Compiler Inspired by Finite Automata Theory
This paper presents a new algorithm for compiling term pattern-matching for functional languages. Earlier algorithms may produce duplicated code, and redundant or sub-optimal discrimination tests for certain combinations of patterns, in particular when a pattern column contains a mixture of constructors and variables. This algorithm, which was inspired by finite automata theory, addresses these...
متن کاملType-Based Optimization for Regular Patterns
Pattern matching mechanisms based on regular expressions feature in a number of recent languages for processing XML. The flexibility of these mechanisms demands novel approaches to the familiar problems of pattern-match compilation—how to minimize the number of tests performed during pattern matching while keeping the size of the output code small. We describe work in progress on a compilation ...
متن کاملTwo-phase Pattern Matching for Regular Expressions in Intrusion Detection Systems
Regular expressions are used to describe security threats’ signatures in network intrusion detection (NID) systems. To identify suspicious packets using regular expression matching, many NID systems use memory-based deterministic finite-state automata (DFA) with one-pass-scanning model, which is fast and allows dynamic updates. However, a number of practical signature patterns commonly found in...
متن کاملA Novel Algorithm for Pattern Matching Based on Modified Push-Down Automata
In this paper we propose a new algorithm called MEPda (Motif Extraction algorithm based on Push-down automata) to solve the problem of finding patterns containing loops. These loop-patterns or loop-motifs are very known and used in many domains, especially in mathematics and bioinformatics. MEPda meant to find these kinds of patterns by using pushdown automata as a mechanism of matching process...
متن کاملOFA: A Scalable Finite Automata-based Pattern- Matching Engine for Out-of-Order Deep Packet Inspection
To match the signatures of malicious traffic across packet boundaries, network-intrusion detection (and prevention) systems (NIDS) typically perform pattern matching after flow reassembly or packet reordering. However, this may lead to the need for large packet buffers, making detection vulnerable to denial-of-service (DoS) attacks, whereby attackers exhaust the buffer capacity by sending long ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003